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HKU1 is a human betacoronavirus that causes mild yet prevalent 
respiratory disease’, and is related to the zoonotic SARS? and 
MERS? betacoronaviruses, which have high fatality rates and 
pandemic potential. Cell tropism and host range is determined 
in part by the coronavirus spike (S) protein*, which binds cellular 
receptors and mediates membrane fusion. As the largest known 
class I fusion protein, its size and extensive glycosylation have 
hindered structural studies of the full ectodomain, thus preventing 
a molecular understanding of its function and limiting development 
of effective interventions. Here we present the 4.0 A resolution 
structure of the trimeric HKU1 S protein determined using single- 
particle cryo-electron microscopy. In the pre-fusion conformation, 
the receptor-binding subunits, $1, rest above the fusion-mediating 
subunits, $2, preventing their conformational rearrangement. 
Surprisingly, the $1 C-terminal domains are interdigitated and form 
extensive quaternary interactions that occlude surfaces known in 
other coronaviruses to bind protein receptors. These features, along 
with the location of the two protease sites known to be important for 
coronavirus entry, provide a structural basis to support a model of 
membrane fusion mediated by progressive S protein destabilization 
through receptor binding and proteolytic cleavage. These studies 
should also serve as a foundation for the structure-based design of 
betacoronavirus vaccine immunogens. 

Betacoronavirus S proteins are processed into S1 and S2 subunits 
by host proteases’. Like other class I viral fusion proteins, the two 
subunits trimerize and fold into a metastable pre-fusion conforma- 
tion. The S1 subunit is responsible for receptor binding, while the S2 
subunit mediates membrane fusion. Coronaviruses typically possess 
two domains within S1 capable of binding to host receptors: an amino 
(N)-terminal domain (NTD) and a carboxy (C)-terminal domain 
(CTD), with the latter recognizing protein receptors for SARS-CoV 
and MERS-CoV®”. Although these individual domains have been 
structurally characterized, the organization of the complete spike has 
not yet been determined, preventing a mechanistic understanding of 
S protein function. 

Here, we present the structure of the HKU1 S protein ectodomain 
determined using cryo-electron microscopy (cryo-EM) to 4.0 A res- 
olution (Fig. 1a and Extended Data Figs 1 and 2 and Extended Data 
Table 1). The protein construct contains a C-terminal T4 fibritin tri- 
merization motif and a mutated S1/S2 furin-cleavage site (Extended 
Data Fig. 3). The S1 subunit adopts an extended conformation with 
short linkers between domains and sub-domains (Fig. 1b). The SI NTD 
(amino acids 14-297) has strong structural and sequence homology to 
the bovine coronavirus (BCoV) $1 NTD (Extended Data Fig. 4), which 
recognizes acetylated sialic acids on glycosylated cell-surface receptors®. 
The glycan-binding site in the BCoV S1 NTD is conserved in the HKU1 
S1 NTD and is located at the apex of the trimer, oriented towards target 
cells. Indeed, HKU1 S1 was recently shown to bind O-acetylated sialic 


acids on host cells, and these glycans were required for efficient infec- 
tion of primary human airway epithelial cultures’. 

The HKU1S1 CTD (amino acids 325-605) consists of a structurally 
conserved core connected to a large, variable loop (HKU1 S amino 
acids 428-587)!” that is partially disordered (Extended Data Figs 5 
and 6). The CTD is located at the trimer apex close to the threefold 
axis, and the core interacts with the other two $1 CTD cores and with 
one NTD from an adjacent protomer. The domain swapping between 
protomers results in a woven appearance when viewed looking down 
towards the viral membrane (Fig. 2a). Structural alignment of the 
SARS-CoV and MERS-CoV CTD-receptor complexes!!!” with the 
HKU1 pre-fusion S protein reveals that the protein-receptor-binding 
surface of the $1 CTD is buried in the HKU1 S protein trimer and is 
therefore incapable of making equivalent interactions without some 
initial breathing and transient exposure of these domains (Fig. 2b). 
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Figure 1 | Structure of the HKU1 pre-fusion spike ectodomain. 

a, A single protomer of the trimeric S protein is shown in cartoon 
representation coloured as a rainbow from the N to C terminus (blue to 
red) with the reconstructed EM density of remaining protomers shown 
in white and grey. b, The S1 subunit is composed of the NTD and CTD 
as well as two sub-domains (SD-1 and SD-2). The $2 subunit contains 
the coronavirus fusion machinery and is primarily a-helical. c, Domain 
architecture of the HKU1 S protein coloured as in a. 
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Figure 2 | Architecture of the HKU1 S1 subunit. a, EM density 
corresponding to each S1 protomer is shown. The putative glycan-binding 
and protein-receptor-binding sites are indicated with dashed shapes on 

the NTD and CTD, respectively. b, The HKU1 S1 CTD forms quaternary 
interactions with an adjacent CTD using a surface similar to that used 

by SARS CTD to bind its receptor, ACE2 (ref. 11). c, Sub-domain 1 is 
composed of amino acid residues before and after the $1 CTD. d, Sub- 
domain 2 is composed of $1 sequence C-terminal to the CTD, a short 
peptide following the NTD, and the N-terminal strand of S2, which follows 
the S1/S2 furin-cleavage site. 


Although a protein receptor has not yet been identified for HKU1, 
antibodies against the CTD, but not those against the NTD, blocked 
HKU1 infection of cells'*. These data suggest that the $1 CTD is the 
primary HKU1 receptor-binding site'*, whereas the NTD mediates 
initial attachment via glycan binding. 

HKU 1 S1 also contains two sub-domains (which we term SD-1 and 
SD-2) that lack significant homology to previously determined struc- 
tures (Fig. 2c, d). These sub-domains are primarily composed of $1 
amino acid sequences following the CTD. However, stretches of amino 
acids preceding the CTD as well as S2 residues adjacent to the S1/S2 
cleavage site also contribute to the sub-domains. This complex folding 
of elements dispersed throughout the primary sequence may allow 
receptor-induced conformational changes in the CTD to be transmit- 
ted to other parts of the structure. 

In contrast to other viral fusion proteins such as influenza haemag- 
glutinin (HA)!* or HIV-1 envelope (Env)!*"°, the HKU1 $1 subunits are 
rotated about the trimeric threefold axis with respect to the S2 subunits, 
causing the $1 subunit from one protomer to sit above the $2 subunit 
of an adjacent protomer (Extended Data Fig. 7). Similar to HA and 
Envy, a region in the HKU1 S1 CTD (amino acids 371-380) caps the S2 
central helix, thereby preventing the fusion machinery from springing 
into action. 

Processing of coronavirus S proteins by host proteases plays a critical 
role in the entry process®. HKU1 S is cleaved by furin into $1 and $2 
subunits during protein biosynthesis. Though mutated in the protein 
construct used here and disordered in the density map, the HKU1 S 
furin-cleavage site at the $1/S2 junction lies in a loop of SD-2 (Fig. 3 
and Extended Data Fig. 6). Furin cleavage would leave a single S2 
8-strand participating in the SD-2 3-sheets (Fig. 2d). Coronavirus S 
proteins also have a secondary cleavage site, termed $2’ (Arg900)°, 
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Figure 3 | HKU1 S2 subunit fusion machinery. a, The HKU1 S2 subunit 
is coloured like a rainbow from the N-terminal 3-strand (blue), which 
participates in $1 sub-domain 2, to the C terminus (red) before HR2. 

b, The HKU1 S82 structure contains the fusion peptide (FP) and a heptad 
repeat (HR1). Protease-recognition sites are indicated within disordered 
regions of the protein (dashed lines). c, A comparison of coronavirus $2 
HR1 in the pre- and post-fusion” conformations. Five HR1 a-helices are 
labelled and coloured like a rainbow from blue to red, N to C terminus, 
respectively. The structures are oriented to position similar portions of the 
central helix (red). 


adjacent to the viral fusion peptide (amino acids 901-918)!” (Fig. 3b 
and Extended Data Fig. 6). This is similar to the multiple endoprote- 
olytic cleavage events that occur in the fusion proteins of respiratory 
syncytial virus (RSV) and Ebola virus'®!’. Protease cleavage at $2’ likely 
follows S1/S2 cleavage and may not occur until host-receptor engage- 
ment at the plasma membrane or viral endocytosis. 

As in all class I viral fusion proteins, the coronavirus $2 subunit con- 
tains the four elements required for membrane fusion: a fusion peptide 
or loop, two heptad repeats (HR1 and HR2), and a transmembrane 
domain'*”°*!, Refolding of HR1 into a long a-helix thrusts the fusion 
peptide into the host-cell membrane, and as the two heptad repeats 
interact to form a coiled-coil, the host and viral membranes are brought 
together. The fusion peptide, conserved among coronavirus S proteins!” 
(Extended Data Fig. 6), is located on the exterior of the HKU1 S pro- 
tein and is adjacent to the putative S2’ cleavage site, which remains 
uncleaved in our structure. The fusion peptide forms a short helix and 
a loop, with most of the hydrophobic amino acids buried in an interface 
with other elements of $2. Unlike influenza HA where the C terminus of 
the fusion peptide is only 14 amino acids away from the N terminus of 
HRI, the fusion peptide of HKU1 S is 60 amino acids away from HRI. 
This span of protein contains four short a-helices and several longer 
regions lacking regular secondary structure. This intervening sequence 
is also buried beneath SD-2 and the S2’ cleavage site, suggesting that 
cleavage may affect the proclivity of S2 for undergoing the transition 
to the post-fusion conformation. 

Coronavirus S protein heptad repeats are unusually large with HR1 
encompassing more than 90 amino acids”°. In the cryo-EM structure, 
HR2 is located at the base of the HKU1 S protein near the viral mem- 
brane, but is poorly ordered, precluding unambiguous assignment of 
the residues. However, HRI is well ordered and arranged along the 
length of the S2 subunit, forming four short helices and part of the 
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Figure 4 | Comparison of structurally related class I viral fusion proteins. The fusion proteins from coronaviruses, influenza virus and HIV-1 are 
cleaved into receptor-binding subunits (pink, light green, light blue) and the viral fusion machinery (dark red, dark green, blue)!*"'®?8, Comparison to 


other class I fusion proteins can be found in Extended Data Fig. 8. 


central three-helix bundle. This arrangement of HR1 is similar to that 
of influenza HA, although in HA the HR1 is organized as two helices 
connected by a long loop!*. Conversion of influenza HA to the post- 
fusion conformation requires these protein elements to transition into 
a single long «-helix”!. The post-fusion six-helix bundle structures of 
SARS-CoV and MERS-CoV 82 heptad repeats”? reveal that corona- 
virus S proteins also undergo a similar transition (Fig. 3c). However, 
the S protein must carry out five such loop-to-helix transitions, high- 
lighting the complexity of S proteins relative to other class I fusion 
proteins. In addition, the membrane distal regions of the pre-fusion S2 
central three-helix bundle (S2 amino acids 1070-1076), which is the 
C-terminal portion of HR1, are splayed outwards from the threefold 
axis (Extended Data Fig. 7). In the available coronavirus post-fusion 
HR1-HR2 structures, this portion of HR1 forms a tight three-helix 
bundle?*?>, Formation of this three-helix bundle may be prevented 
by interactions between the C-terminal end of the S2 HRI and the 
S1 CTD, and thus disruption of these interactions through receptor- 
induced conformational changes would provide an additional means 
by which receptor binding in S1 can initiate S2-mediated membrane 
fusion. Indeed, protease cleavage and an acidic pH are thought to be 
insufficient to trigger the transition to the post-fusion conformation 
without additional destabilization provided by receptor binding”***. 

The formation of anti-parallel six-helix bundles composed of HR1 
and HR2 in the post-fusion conformation is a unifying feature of class I 
viral fusion proteins. However, the pre-fusion conformations of this 
protein family are incredibly diverse in size and topology (Extended 
Data Fig. 8). The HKU1 S protein structure presented here most closely 
resembles influenza virus HA and HIV-1 Env (Fig. 4), which also have 
receptor-binding subunits that cap the central helix of the fusion sub- 
unit!*+!52728 However, some core elements of the fusion machinery are 
conserved amongst all class I fusion proteins, including paramyxovirus 
F proteins. 

The HCoV-HKUIS protein trimer in a pre-fusion conformation is, 
to our knowledge, the largest class I viral fusion glycoprotein structure 
determined to date (Fig. 4 and Extended Data Figs 8 and 9). Since 
betacoronavirus S proteins are similar in size and have a conserved 
domain organization, our findings should be generally applicable 
to other betacoronaviruses, including SARS-CoV and MERS-CoV 
(Extended Data Fig. 6). Our studies provide a structural basis for S pro- 
tein function wherein the pre-fusion S protein is progressively matured 
and destabilized by receptor binding and protease cleavage. Following 
dissociation of the $1 subunits, HR1 would transition to a long «-he- 
lix, and the fusion peptide would be released from the side of the S2 
subunit and inserted into host membranes. The structure and mecha- 
nistic insights presented here should enable engineering of pre-fusion 


120 | NATURE | VOL 531 | 3 MARCH 2016 


stabilized coronavirus S proteins as vaccine immunogens against cur- 
rent and emerging betacoronaviruses, similar to recent efforts for other 
viral fusion proteins”?°. This work also acts as a springboard for future 
studies to define mechanisms of antibody recognition and neutrali- 
zation, which will lead to an improved understanding of coronavirus 
immunity. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. 
The investigators were not blinded to allocation during experiments and outcome 
assessment. 

Protein expression and purification. A mammalian-codon-optimized gene 
encoding HKU1 S (isolate N5, NCBI accession QOZME7) residues 1-1276 with 
a C-terminal T4 fibritin trimerization domain, a HRV3C cleavage site, and a 
6xHis-tag was synthesized and subcloned into the eukaryotic expression vec- 
tor pVRC8400. The S1/S2 furin-recognition site 752-RRKRR-756 was mutated 
to GGSGS to generate the uncleaved construct used for cryoEM studies. Three 
hours after this plasmid was transfected into FreeStyle 293-F cells (Invitrogen), 
kifunensine was added to a final concentration of 51M. FreeStyle 293-F cells area 
high-transfection-efficiency cell line adapted for suspension culture derived from 
low passage clonal cultures and after purchase were not further authenticated. 
Cells were not confirmed to be free of mycoplasma, but were only used for pro- 
tein expression. Cultures were harvested after six days, and protein was purified 
from the medium using Ni-NTA Superflow resin (Qiagen). The buffer was then 
exchanged using a HiPrep 26/10 desalting column (GE Healthcare Biosciences) 
from a high-imidazole elution buffer to a low pH buffer (20 mM Bis-Tris pH 6.5, 
150 mM NaCl). Afterward, endoglycosidase H (EndoH) (10% w/w) and HRV3C 
protease (1% w/w) were added to the protein and the reaction was incubated over- 
night at 4°C. The digested protein was further purified using a Superose 6 16/70 
column (GE Healthcare Biosciences). 

The furin-cleaved HKU1S construct analysed by negative-stain EM was similar 
to the one described above except that it encoded residues 1-1249 and contained 
the wild-type RRKRR furin-recognition site. Expression and purification were also 
similar, except that a plasmid expressing furin was co-transfected into the FreeStyle 
293-F cells to ensure complete processing of the protein. 

Sample preparation for negative-stain electron microscopy. HKUI S proteins 
were placed directly onto 400 copper mesh grids and then stained with 1% uranyl 
formate. Tris-buffered saline (TBS) was used as buffer if dilution was necessary. 
Negative-stain electron microscopy data collection. Grids were loaded into a 
Tecnai T12 Spirit operating at 120 keV and imaged using a Tietz TemCam-F416 
CMOS at 52,000 x magnification at ~1.5 1m under focus. Micrographs were 
collected using Leginon*! and processed within Appion™. Particles were picked 
using a difference-of-Gaussians approach** and aligned using reference-free 2D 
classification employing iterative multivariate statistical analysis/multi-reference 
alignment (MRA/MSA) using a binning factor of 2 to remove amorphous parti- 
cles**. Particles in classes that did not represent views of HKU1 S proteins were 
discarded. ISAC* was used to generate a template stack from which initial 3D 
models were generated using the EMAN2 (ref. 36) procedure initialmodel.py. 3D 
models were refined using EMANI (ref. 37). 

Sample preparation for cryo-electron microscopy. Sample solution (3 11) was 
applied to the carbon face of a CF-2/2-4C C-Flat grid (Electron Microscopy 
Sciences, Protochips) that had been plasma cleaned for five seconds using a mix- 
ture of Ar/O, (Gatan Solarus 950 Plasma system). The grid was then manually 
blotted and immediately plunged into liquid ethane using a manual freeze plunger. 
Cryo-electron microscopy data collection. Movies were collected via the 
Leginon interface on a FEI Titan Krios operating at 300 keV mounted with a 
Gatan K2 direct-electron detector*!. Each movie was collected in counting mode 
at 22,500 x nominal magnification resulting in a calibrated pixel size of 1.31 A/pix 
at the object level. A dose rate of ~10 e~/((cam pix) x s) was used; exposure time 
was 200 ms per frame. The data collection resulted in a total of 1,049 movies con- 
taining 50 frames each. Total dose per movie was 57 e~/A”. Data were collected at 
1.0 to 3.544m under focus. 

Cryo-electron microscopy data processing. Frames in each movie were aligned”, 
and CTF estimation was carried out using CTFFIND3 (ref. 39). Particles were 
picked from a subset of the data employing a difference-of-Gaussians approach”? 
and aligned using reference-free 2D classification employing iterative MRA/MSA 
using a binning factor of two™*. The resulting 2,188 particles were used to generate 
an initial 25 A lowpass-filtered 3D reconstruction using EMAN2. SPIDER refproj. 
spi’? with a delta theta angle of 15 degrees was used to generate 83 projection 
images of the initial 3D reconstruction. These projection images were used as 
templates for picking particles from the entire cryo data set. Particles from the 
entire data set were aligned and classified with the same methods used for the 
subset of particles stated above. After 2D classification, unbinned selected particles 
were symmetrically refined in RELION version 1.3 (refs 41, 42) against the initial 
3D reconstruction filtered to 60 A resolution. This refinement was followed by 
particle polishing and refinement of the resulting realigned, B-factor-weighted 
and signal-integrated particles using RELION version 1.4b1. The resolution of the 
final map was 4.04 A at an FSC cutoff of 0.143. A mask was generated in RELION 
using a threshold that accounted for the entire structure. From this threshold, the 
mask was further dilated by 3 voxels and a Gaussian fall-off was generated over an 


additional 6 voxels. The mask effect on FSC was taken into consideration. Phases 
were randomized in the unfiltered half-set maps for initial FSC lower than 0.8 
and a new FSC between these phase-randomized maps was generated and used to 
correct for mask effects in the final FSC-based resolution estimate. The reported 
resolution of 4.04 A is the RELION CorrelationCorrected value 

The map was B-factor sharpened employing FSC-weighting. The B-factor 
was estimated in RELION based on the resolution range from 10 A to 2.62 A 
(B-factor = —117 A?). The detector MTF file was provided to RELION. 
Model building and refinement. An initial model of the S1 NTD was generated 
using the Modeller* homology modelling tool in UCSF Chimera with the BCoV 
NTD (PDB 4H14)*as a template. The NTD homology model was docked into the 
HKU1S protein EM density and refined with Rosetta density-guided iterative local 
refinement* while imposing C3 symmetry. Rosetta output models were clustered 
based on pairwise r.m.s.d. using a cluster radius of 2.15 A. The lowest energy model 
from the largest cluster was selected for additional refinement. This model and the 
conserved CTD core from SARS-CoV (PDB 2AJE)!! were used as starting struc- 
tures for model building and refinement. These starting models and the remaining 
HKU! protein sequence were modelled manually using COOT* and refined using 
RosettaRelax’”. Structures were evaluated using EMRinger*® and Molprobity”. 
Figures were produced in the PyMol*? or UCSF Chimera“ software packages. 
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Extended Data Figure 1 | Data processing flowchart. a, Processing resolution of 4.04 A is indicated in the plot. c, Angular distribution of 
resulting in density map of pre-fusion HKU1 spike glycoprotein at 4.04 A raw data within the data set. A slight, but within normal range, over- 
resolution. b, FSC plot illustrating correlation between two volumes representation of top views was observed (tall red bars). 


refined independently from two distinct half sets of raw data. A final 
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Extended Data Figure 2 | Resolution of the pre-fusion HKUI1 S density in stable internal secondary structures to greater than 5.00 A in flexible 


map. a, Local resolution within the EM density map. Local resolution peripheral loops. b, Close-ups of secondary-structure densities. To the 
was calculated using ResMap”! discretizing every 0.25 A over a range left is displayed the central «-helix of an $2 monomer and to the right is a 
from 2 x voxel size (2.62 A) to 4 x voxel size (5.24 A). Resolution 3-sheet from the NTD domain in an $1 monomer. 


significance criterion was set to 0.05. The resolution ranges from 3.74A 
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Wild-type S1/S2 Cleavage Site + - - 

Foldon trimerization domain + + - 
Extended Data Figure 3 | Cleavage at the $1/S2 junction does not spike 1-1276 with an attached foldon and a mutated furin-cleavage site 
induce large conformational changes in HKU1 spike. a, HKU1 spike reconstructed using negative-stain electron microscopy. c, HKU1 spike 
1-1249 with an attached foldon domain and wild-type furin-cleavage site 1-1249 without foldon and with mutated furin-cleavage site. Side and top 
was reconstructed using negative-stain electron microscopy. b, HKU1 views are shown. 
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receptor binding site 
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Extended Data Figure 4 | Putative glycan binding site of the HKU1 involved in the putative glycan-binding site (dashed circle) are shown 
$1 NTD. a, HKU1 trimeric S and b, an isolated monomer. Putative host as sticks, with oxygen atoms coloured red and nitrogen atoms coloured 
glycan-binding and protein-receptor-binding sites are indicated. c, The blue. Note that N198 (BCoV) and N188 (HKU1) are predicted N-linked 
bovine coronavirus (BCoV) $1 NTD structure from Peng et al.* (teal) glycosylation sites. 


is superposed onto the HKU1 S NTD (pink). Residue side-chains 
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Extended Data Figure 5 | Betacoronavirus S proteins possess a secondary structure: grey) and the insert which differs amongst 
conserved structural core in their C-terminal domains. a, The coronaviruses is coloured yellow. Atoms participating in quaternary 
structurally divergent loop of the $1 CTD is poorly ordered distal to the interactions with other HKU1 S protomer CTDs are shown in green 
core CTD domain. The conserved $1 CTD cores!” of b, HKU1-CoV surface in c. f, The positions of these interacting atoms are mapped on to 
highlighted in the trimeric pre-fusion S, c, HKU1-CoV as an isolated the conserved core topology. The sheet and helix nomenclature is taken 
domain, d, MERS-CoV’’ and e, SARS-CoV"! are coloured according from reference 10. 


to secondary structure (3-sheets: pink, a-helices: blue, lacking regular 
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Extended Data Figure 6 | Sequence alignment of human C-terminal domain (CTD) which contains the large variable loop, the 
betacoronavirus S proteins. Sequence alignment of S proteins from S1/S2 and $2’ cleavage sites, fusion peptide (FP), heptad repeats 1 and 2 
HKUI, SARS-CoV and MERS-CoV using Clustal Omega>’. Protein (HR1, HR2) and transmembrane helix (TM). 


features described in the text are indicated: N-terminal domain (NTD), 
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Extended Data Figure 7 | $1 sits atop an adjacent protomer’s $2. a, The 
HKU1 S1 subunits are rotated about the trimeric threefold axis relative to 
their corresponding S2 subunits such that the $1 CTD from one protomer 
caps the S2 central helix from an adjacent protomer (CTD, blue, caps 
$2, red). The third protomer of the trimer has been omitted for clarity. 

b, HKU1 S1 CTD (blue) uses a short helix to cap the central helix and 
HRI (red). c, The influenza haemagglutinin HA2 central helix (red) is 
also capped by a helix in HAI (blue)'*”®. d, The $2 N-terminal 8-strand 
is connected to the remainder of the $2 subunit via a loop and an a-helix 


(dotted lines). These regions of the EM density are of insufficient quality 
to confidently build this protein region but enable interpretation of 
connectivity. e, In the pre-fusion HKU1 S protein, the tops of the central 
S2 helices (blue, red, green) are splayed outwards from the threefold axis 
and capped by the S1 CTDs (white). The $1 NTD, SD-1 and SD-2 have 
been omitted for clarity. f, In the post-fusion six-helix-bundle structure of 
SARS S”, the corresponding helical regions from (e) form a well-packed 
three-helix bundle. 
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Extended Data Figure 8 | Class I viral fusion proteins. All class I fusion 
proteins require proteolytic cleavage adjacent to the fusion peptide or loop, 
and the metastable pre-fusion state is triggered by a series of events that 
involve pH change or receptor binding. The post-fusion conformations 

all contain anti-parallel six-helix bundles composed of the HR1 and 

HR2 from the membrane-proximal subunit. However, there is a great 
diversity in pre-fusion conformations as shown here. Members of this 

class that also participate in receptor binding!*"1»3 (top row), including 


HIV-1 Env 


Ebolavirus GP 


S glycoproteins of coronaviruses, are organized such that their receptor 
binding subunits sit atop the fusion machinery, and need to be shed in 
order for membrane fusion to proceed. Paramyxovirus F proteins**°” 
(bottom row) have a different architecture than the capped fusion proteins 
on the top row. The F proteins all have disulfide bonds between the 
membrane proximal and membrane distal subunits, and the two subunits 
remain interconnected throughout the rearrangement process. 
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b 
Extended Data Figure 9 | HKU1 S glycosylation. a, Sites of N-linked of density in the EM map is observed for 10 sites corresponding to the 
glycosylation on the HKU1 S trimer and b, a single monomer. Of the EndoH-trimmed sugars. Asparagines where glycan density is observed are 
30 potential N-linked glycosylation sites in a single protomer, the shown as magenta spheres. Asparagines lacking glycan density are shown 
asparagine residues are observed for 21 sites and of these a small portion in green. 
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Extended Data Table 1 | CryoEM data collection, processing and refinement metrics 


Data collection/processing 


Microscope Titan Krios 
Voltage (keV) 300 
Defocus range (um) IMtosS 
Movies 1,049 
Frames per movie 50 
Exposure time per frame (ms) 200 
Magnification 22,500x 
Dose rate (e7/pixel/s) 10 
Total dose per movie (e-/A?) 57 
Particles 31,435 
Map Resolution (A) 4.04 
Model Refinement 
Chimera CC*4 0.87 
EMRinger Score*® 27 
MolProbity*? 1.6 
Clashscore‘*? 3.0 
Ramachandran (%)*9 

Favored 92.1 

Allowed 7.0 

Outliers 0.9 


CC=cross correlation 
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